ERGODIC UNIVERSALITY OF SOME TOPOLOGICAL 
DYNAMICAL SYSTEMS 



ANTHONY QUAS AND TERRY SOO 

Abstract. The Krieger generator theorem says that every invert- 
ed ible ergodic measure-preserving system with finite measure-theor- 
etic entropy can be embedded into a full shift with strictly greater 
topological entropy. We extend Krieger's theorem to include toral 
automorphisms, and more generally, any topological dynamical 
system on a compact metric space that satisfies almost weak spec- 
ification, asymptotic entropy expansiveness, and the small bound- 
ary property. As a corollary, one obtains a complete solution to a 
natural generalization of an open problem in Halmos's 1956 book 
regarding an isomorphism invariant that he proposed. 



1. Introduction 

Let S be a self-homeomorphism of a compact metric space (Y, d). Let 
T be an invertible ergodic measure-preserving transformation on a non- 
atomic probability space (fl, a). An embedding of (fl, /i, T) into (Y, S) 
is a measurable mapping \1/ : Q — > Y such that the restriction of $ to a 
set of full measure fl' is an injection, and \P(T(u;)) = S for all u> G 

Q'. We say that the topological dynamical system (Y, S) is universal 
if for every invertible ergodic measure-preserving system (X, u, T) with 
measure-theoretic entropy strictly less than the topological entropy of 
S there exists an embedding of T) into (Y, S), and we say that 
(Y, S) is fully universal if the embedding can be chosen so that the 
push-forward of the measure on Q is fully supported on Y. The Krieger 
finite generator theorem [501 EI] says that the full shift on a finite 
number of symbols is universal. We prove the following extension of 
Krieger's theorem. 

Theorem 1. Toral automorphisms are universal. 
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Following Lind [36], we say that a toral automorphism is quasi- 
hyperbolic if its associated matrix A has no roots of unity as eigen- 
values, and hyperbolic if A does not have an eigenvalue of modu- 
lus 1. Lind and Thouvenot [37J Section 5] proved that hyperbolic 
(two-dimensional) toral automorphisms are fully universal and asked 
whether the same is true in the quasi-hyperbolic case, and in the more 
general case of an automorphism of a compact group. Lind and Thou- 
venot made use of the fact that hyperbolic toral automorphisms can 
be represented as irreducible shifts of finite type; this is not true in 
the non- hyperbolic case [35J Section 6], [35J Theorem 4]. We prove an 
affirmative answer to their question, which easily implies Theorem [TJ 

Theorem 2. A quasi- hyperbolic toral automorphism is fully universal. 

Theorem [2] will be proved as part of a more general result which we 
state in Section [21 The result has three conditions, one of which is a 
form of specification. 

Let S be a self-homeomorphism of a compact metric space (Y,d). 
Suppose that for every e > there exists a function L e : Z + — > Z + such 
that given a finite number of points yi, . . . , y n G Y and finite sequence 
of integers ai < bi < a 2 < b 2 ■ ■ ■ a n < b n with <2j — > L e (bi — cij) for 
all i G [2, n], there is a y G Y with d(S k y, S k yi) < e for all k G [a i: bi] 
and i G We call L £ a gap function. If the gap function L e 

satisfies L e (m)/m — > as m — > oo, then we say that S satisfies almost 
weak specification; if the function L £ is a constant function, then S 
satisfies weak specification, and in addition, if y can be chosen to 
be a periodic point, then S satisfies specification (for background see 

HE31EDEI]). 

Marcus [5H] proved that quasi-hyperbolic toral automorphisms sat- 
isfy almost weak specification. Let us remark that a toral automor- 
phism is ergodic with respect to Haar measure if and only if it is quasi- 
hyperbolic [21]. 

Conjecture 1. A self-homeomorphism with almost weak specification 
on a compact space is (fully) universal. 

Recently, we also proved that the time-one map of a geodesic flow 
on a compact surface of negative curvature is universal [13] . With the 
help of the symbolic dynamics for geodesic flows developed by Bowen 
[6] and Ratner [16] , it was sufficient to show that the time-one map of a 
topologically weak-mixing suspension flow over an irreducible subshift 
of finite type is universal; in our proof of this result we were aided 
by the symbolic nature of the suspension flow and the fact that the 
time-one map satisfies (weak) specification [15J Proposition 5]. 



ERGODIC UNIVERSALITY 



3 



Krieger also proved that mixing subshifts of finite type are universal 
[55] (see also the proof given by Denker [TU Theorem 28.1]). We also 
prove the following generalization of Krieger's theorem, and Lind and 
Thouvenot's result [371 Theorem 2] that every mixing subshift of finite 
type is fully universal. 

Theorem 3. A subshift with almost weak specification on a finite num- 
ber of symbols is fully universal. 

We will prove a weaker version of Conjecture [IJ Theorem [7J from 
which Theorems [2] and [3J will follow. We will require some additional 
conditions that are satisfied in Theorems [2] and [3j We will give the 
precise statement of these additional conditions in the next section. 

In our proof of Theorem [7J we make use of an idea of Burton and 
Rothstein [TTJ, further developed in work of Burton, Keane and Serafin 
[TP] , and Downarowicz and Serafin [T5] that involves producing the 
required injections using the Baire Category theorem. 

In Section [5J we will prove Theorems [2] and [5J using our more general 
result, and apply Theorem[3jto show that /3-shifts are universal. We will 
also generalize Theorem [2] to include any automorphism of a compact 
metric abelian group that is ergodic with respect to Haar measure. In 
Section H] we set the stage for the proof of our more general result; the 
proof will be carried out in the remaining sections. 

We would like to thank Jean-Paul Thouvenot for introducing us to 
the problem. 

2. Non-expansive homeomorphisms and the small boundary 

condition 

We recall in this section some basic tools to deal with non-expansive 
homeomorphisms. Let (X, d) denote a compact metric space. For 
r > 0, we let B(x,r) denote the open ball about a point x G X, and 
for a subset A C X, we let B(A,r) := \J xeA B(x, r). If V is a finite 
(Borel-)measurable partition of X its diameter is defined to be the 
maximum of the diameters of the elements of the partition. The set dV 
is the union of the topological boundaries of the elements of V and d r V 
denotes B(dV,r). We recall that a partition is said to be generating 
if for any distinct pair of points, x and y, there exists an n G Z such 
that T n x and T n y lie in different elements of V. 

A self-homeomorphism T of a metric space (X, d) is expansive if 
there exists a 5 > (the expansiveness constant) such that for all 
x, y G X if d(T n x, T n y) < 5 for all n G Z, then x = y. Expansive 
homeomorphisms have many generating partitions: Indeed any parti- 
tion of diameter less than the expansiveness constant is generating. We 
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recall that any subshift (that is the restriction of the shift map to a 
non-empty closed shift-invariant subset of the full shift) is expansive. 

We write Mt{X) for the collection of T-invariant Borel probability 
measures on X. Another desirable feature of expansive homeomor- 
phisms is that the entropy functional sending an invariant measure to 
its measure-theoretic entropy is upper semi-continuous with respect to 
the weak* topology (see Lemma [TH for an explicit metric which gen- 
erates the weak* topology). In order to consider non-symbolic spaces 
we will make use of two successive weakenings due to Bowen [5] and 
Misiurewicz [41 J which allow us to recover the upper semi-continuity of 
the entropy functional. 

For each 5 > 0, define the set 

T 5 (x) :={yeX: d(T j x,T j y) < 5 for all j > 0}. 

If there is a constant 5 > such that ht op {T, Ts(x)) = for all x G X, 
then T is entropy expansive, and if sup xGX ht op (T, F$ (x) ) — > as 
8 — > 0, then T is asymptotically entropy expansive. 

That these are strict weakenings of expansiveness is seen by simple 
examples such as the identity map, twist maps such as T(x, y) = (x, y+ 
x) mod 1. More substantial examples are given by the following lemma. 

Lemma 4 (Bowen [5J Example 1.2). Toral automorphisms are entropy 
expansive. 

Thus while hyperbolic toral automorphisms are expansive, in general 
toral automorphisms are only entropy expansive. 

Lemma 5 (Misiurewicz [JT] Corollary 4.1). LetT be an asymptotically 
entropy expansive self-homeomorphism of a compact metric space X . 
Then the entropy functional \i h-> h^(T) is upper semi- continuous with 
respect to the weak* topology on Mt(X). 

Bowen proved Lemma [5] in the case where T is entropy expansive. 
For an example of a map that satisfies specification, but is not asymp- 
totically entropy expansive and does not have an upper semi-continuous 
entropy functional see [1U Page 952]. 

Another property that we need is sometimes called the small bound- 
ary property or the existence of an essential partition. A metric space 
is d- dimensional (i.e. it has topological dimension d) if there ex- 
ists an open cover with arbitrarily fine diameter with the property 
that the intersection of any d + 2 distinct elements is empty. A self- 
homeomorphism, T, of a compact metric space X is said to have the 
small boundary property if for each 5 > 0, there exists a partition 
V of X such that diam('P) < 5 and a{dP) = for each T-invariant 
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measure //. Notice that if [i{dP) = 0, then /i(<9(T = 0. Also if 

fi(&p) = and /i(dQ) = 0, then fj,(d(V V Q)) = 0. 

Both the small boundary property and asymptotic entropy expan- 
siveness are important properties in the theory of symbolic extensions 
and entropy structure; for more information see the recent book of 
Downarowicz [T7] and the recent articles of Boyle and Downarowicz 
[7], Burguet [9], Downarowicz [161 UHL an d Lindenstrauss [38]. We 
make use of the following result of Kulesza. 

Lemma 6 (Kulesza [34J Lemma 3.7). A self-homeomorphism of a com- 
pact finite- dimensional metric space with the property that the periodic 
points form a zero- dimensional set has the small boundary property. 

Krieger [33J raised the question of when a homeomorphism of a com- 
pact metric space is universal. We have the following partial answer. 

Theorem 7. A self-homeomorphism of a compact metric space is fully 
universal whenever if satisfies: 

(1) almost weak specification; 

(2) asymptotic entropy expansiveness; and 

(3) the small boundary property. 

Notice that Theorem [3] follows immediately from Theorem [7J 

3. Examples 

3.1. Subshifts. We call a subshift non-trivial if it does not consist of 
a finite set of points. Subshifts have a natural generating partition given 
by V := {Pi, . . . P n }, where Pi = {x G X : xo = i} for all 1 < i < n. 

Familiar examples of subshifts that satisfy specification are mixing 
subshifts of finite type, and mixing sofic subshifts. However, there are 
many subshifts that do not satisfy almost weak specification, but are 
still universal. 

Corollary 8. // a subshift contains subshifts that satisfy almost weak 
specification and have topological entropy arbitrarily close to the origi- 
nal topological entropy, then it is universal. 

Proof. Immediate from Theorem [3] and the definition of universality. 

□ 

The /3-shifts were introduced by Renyi [UJ and can be defined as 
follows. For p > 1, let T be the full-shift on = E := {0, 1, ... , [f3\ } 
and let ^ denote the lexicographic order on S z and S z+ . Let uj G S z+ 
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be the locally greedy /3-expansion of 1; that is if a G £ is a solution 
to the equation 

oo 
3=1 H 

then a ^ u. The /3-shift is the subshift of the full-shift such that x G S z 
belongs to the subshift if and only if (T k x)°l l ^ u for all k > 1. 

Schmeling [50, Theorem B, E] proved that the set of all of j3 G 
(l,oo) such that /3-shift does not satisfy specification is residual and 
its symmetric difference with (1, oo) is a set of measure zero; see [TJ [2] 
for background. 

Proposition 9. For each (3 > 1 the (3-shift is universal. 

Let L be the set of all allowed (finite) words in the subshift X, so 
that C(X) = C := [J n C n , where for n > 1 we define 

C n := {xi ■ ■ -x n : x G X} . 

A subshift generated by free concatenations of a finite set of words is 
often called a renewal system. Renewal systems are sofic [56J. 

Proof. Climenhaga and Thompson (121 Section 3.1] show that for /3- 
shifts, by taking 

Qn ■= {gi ■ ■ ■ 9n G C n ■ Qn = and UJ n ^ 0} , 

we have that 

limsup - log | Q n | = log(/3) = h top (T); 

n— >oo n 

and furthermore, given any k words in Q n can be concatenated to pro- 
duce an element of Q\. n . Thus the subshift X n generated by Q n is a 
mixing sofic renewal system. Clearly, the sequence of subshifts X n has 
topological entropy arbitrary close to log(/3). Thus /3-shifts are univer- 
sal by Corollary [HI □ 

3.2. Toral automorphisms. Lind [35| Section 3] classifies hyperbolic 
and quasi-hyperbolic toral automorphisms into three disjoint classes 
according to the spectral properties of the associated matrix. These 
classes correspond precisely to the various forms of specification that 
were defined in Section [TJ Recall that associated matrix for a hyper- 
bolic toral automorphism has no roots of unity, and hence all quasi- 
hyperbolic toral automorphisms in dimensions two or three are hyper- 
bolic. Bowen [1] proved that hyperbolic toral automorphisms satisfy 
specification. The associated matrix for a central spin automorphism 
has some eigenvalues on the unit circle, and the Jordan blocks for these 
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eigenvalues have no off-diagonal l's, and all other quasi-hyperbolic toral 
automorphisms are central skew automorphisms. Lind J351 Theo- 
rem (ii)] proved that central spin automorphisms (which only occur 
in dimensions 4 or higher) satisfy weak specification, but never satisfy 
specification. Lind [35J Theorem (iii)] also proved that central skew au- 
tomorphisms (which only occur in dimensions 8 or higher) never satisfy 
weak specification, but nevertheless Marcus [39] proved that they still 
must satisfy almost weak specification. It is easy to give explicit exam- 
ples of all the different types of quasi-hyperbolic toral automorphisms 
(see for example [36, Section 3]). 

Proof of Theorem^ By Lemma [T2| we know that toral automorphisms 
are entropy expansive. Since we assume quasi-hyperbolicity, almost 
weak specification is satisfied [3H] , and furthermore an easy linear alge- 
bra argument shows the periodic points are exactly the points on the 
torus with rational coordinates; it is an elementary fact [2"Tj Proposi- 
tion 1.2.4] that countable sets have zero-dimension. Thus Theorem [2] 
follows from Theorem [7] and Lemma [61 □ 

We will use the following elementary factorization lemma to prove 
Theorem [TJ 

Lemma 10. Any toral automorphism is the direct product of a quasi- 
hyperbolic toral automorphism and a toral automorphism of zero topo- 
logical entropy. 

Proof. The lemma is a consequence of the following facts. The topo- 
logical entropy of a toral automorphism is given by the sum of the 
logarithms of the moduli of the associated eigenvalues of modulus at 
least one (see for example [3:, Corollary 16]). A toral automorphism 
with a characteristic polynomial / that can be expressed as a product 
f — gh such that g and h are polynomials over Z with gcd(g, h) — 1 is 
the direct product of toral automorphisms with characteristic polyno- 
mials g and h. A polynomial over Z with a root of unity is the product 
of a polynomial over Z with only roots of unity and a polynomial over 
Z without roots of unity. □ 

Proof of Theorem [TJ Let S be an automorphism of the d- dimensional 
torus T d . By Lemma [TD] let S = S q xS°, where S q is a quasi-hyperbolic 
toral automorphism of a c?i-dimensional torus T dl with the same topo- 
logical entropy as S, and S° is a zero entropy toral automorphism of 
a g?2- dimensional torus T d2 . Note that if T is an invertible ergodic- 
measure preserving transformation of a probability space (fi, /i) and 
\1> is an embedding of (Q, /i, T) into (T d , S q ), then the map ^'(y, z) := 
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O(y), 0) gives an embedding of (X, //, T) into (T dl x T d \S q x S°), since 
is a fixed point of any toral automorphism. Thus the universality of 
(T d , S) follows from the universality of (T dl , S q ) and Theorem [2j □ 

We give the following easy application of Theorem [TJ Let T be a 
C 1+s (5 > 0) diffeomorphism of a compact smooth manifold. Let 

E(T) = {h^(T) : fi is an ergodic invariant measure for T} . 

Katok conjectures E(T) D [0, /i top (T)) and proved this to be true in the 
case where T is a diffeomorphism of a two-dimensional surface [55j [27] ; 
the proof follows immediately from [27| Theorem 4]. Recently, Sun [55] 
showed that E(T) is dense in [0, h top (T)] for a toral automorphism T; 
Theorem [1] extends the result of Sun and implies that Katok's conjec- 
ture is true for toral automorphisms. 

3.3. Automorphisms of a compact metric abelian group. Toral 
automorphisms form an important class of examples of automorphisms 
of compact metric abelian groups. Note that an automorphism always 
preserves the Haar measure. Generalizing the earlier result of Marcus 
[39] , Dateyama [13] proved that an automorphism of a compact metric 
abelian group satisfies almost weak specification if and only if it is 
ergodic with respect to the Haar measure. 

Ergodicity in the setting of an automorphism S of a compact metric 
abelian group Y still has an algebraic characterization. Recall that the 
character group of Y is the set of all continuous group homomorphisms 
of Y into the complex unit circle with multiplication and denoted by Y. 
Let Us : L 2 — >■ L 2 be the Koopman representation given by Ug(x) = 
X ° S, where L 2 is the set of all square integrable functions on Y with 
respect to Haar measure. Since S is also a group homomorphism, we 
have that U$ '■ Y — > Y. Halmos [24J proved that S is ergodic with 
respect to Haar measure if and only if U$ has no finite orbits other 
than the trivial character; that is for all x £ Y an d all n G Z + , we have 
Ugix) = X if and only if x = 1. 

Theorem 11. An automorphism of a compact finite-dimensional met- 
ric abelian group with finite topological entropy whose Koopman rep- 
resentation has no finite orbits on the character group, other than the 
trivial character, satisfies almost weak specification and is universal. 

The proof of Theorem ITT1 is similar to the proof of Theorem [2] making 
use of slightly more technical tools to verify the conditions of Theorem 
[7J The following lemma substitutes entropy expansiveness with asymp- 
totic entropy expansiveness. 
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Lemma 12 (Misiurewicz jH] Example 7.1). A self-homomorphism of a 
compact group with finite topological entropy is asymptotically entropy 
expansive. 

Finally, we also need to argue that the periodic points are countable. 
We do not have an elementary argument for this, and instead will refer 
to a recent paper of Miles [40] who gives a formula for the number of 
periodic points of each finite order of a ergodic finite entropy automor- 
phism of a finite-dimensional compact abelian group; we will not need 
to use the full force of this formula. 

Proof of Theorem [UJ By Lemma [£21 we have asymptotic entropy ex- 
pansiveness. Since the Koopman representation has no non-trivial fi- 
nite orbits, we have that the automorphism is ergodic with respect to 
Haar measure [21] and thus satisfies almost weak specification [T3] . 
Since the automorphism is ergodic and has finite entropy, by [4TJ[ 
Lemma 4.3], the number periodic points of each finite order is finite 
thus the number of periodic points is countable and has zero-dimension 
[21] Proposition 1.2.4]. Theorem [Til now follows from Theorem [7] and 
Lemma El □ 

In his book [25], Halmos proposed an invariant of two-dimensional 
toral automorphisms. He noted that if / : T 2 — > T is one of the coordi- 
nate functions then p(T)f = 0, where p is the characteristic polynomial 
of the automorphism, even though the function / is a non-constant 
function mapping into T. Here, by p(T), if p{t) = J2 a kt k £ we 
mean p(T)f = Y2k a kf ° T k . Clearly the existence of a non-constant 
T- valued / such that p{T)f = almost everywhere is an isomorphism 
invariant. Halmos asked whether this invariant (with the polynomials p 
taken to be of the formp(t) = t 2 — at + 1) distinguishes two-dimensional 
toral automorphisms with distinct characteristic polynomials. A natu- 
ral generalization is to set F(X,T,p) := {p G Z[£] : p(t) is irreducible 
over Z and there exists a non-constant T-valued / such that p(T)f = 0, 
/z-a.e.}. Notice that for a reducible polynomial q(t) G Z[t], there is a 
non-constant T-valued / such that q{T)f = /i-a.e. if and only if one 
of its irreducible factors belongs to F(X, T, p). 

The following corollary establishes that ¥(X, T, p) is no stronger an 
invariant than the rational spectrum (in the sense of its ability to dis- 
tinguish transformations). We would like to thank Benjy Weiss for 
pointing this corollary out to us, and for suggesting we include it in 
this paper. 
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Corollary 13 (Weiss). Let (X, T, u) and (X',T', //) be ergodic in- 
vertible measure-preserving transformations of non-atomic probability 
spaces. IfT andT' have the same rational spectrum, thenF(X,T, //) = 

ppr,TV). 

Proof. Let NCI denote the collection of non-cyclotomic irreducible 
polynomials in Z[£]. If (X, T, a) is non-trivial and p(t) G XCJ \ 
{±l,±t}, then let T z be equipped with the shift map a. Let J p = 
{z G T z : p{o~)z = 0}. This satisfies the conditions of Theorem [TT| 
so that provided (X,T,fx) has entropy smaller than ht op (I p ,a), there 
is a non-constant equivariant map tt: (X, //) — > (I p ,o~). If h^(T) > 
htop{Ip, o~), then we first take a two-element partition V of X such that 
H(V) < h top (I p ,a) and consider the factor of X induced by V. This 
has strictly smaller entropy than h top (I p ,a), which is then embedded 
as before. In either case, we end up with a non-constant equivariant 
map 7r from (X, T, u) to (I p , a). Taking f(z) = z , we see p(T)f on = 
for /i-a.e. x, so that p G P(X, T, /i). 

For a positive integer m, let S denote the transformation S(t) = 
t + l mod m of [m] = {0, . . . , m — 1} and c denote normalized counting 
measure on [m]. We denote the nth cyclotomic polynomial by $„. If 
£ > 1 is a factor of m, then one can check that <3?£ G P([m], 5*, c) (as 
witnessed by the function g(x) = cos(27r(x — a)/£) for a irrational - 
notice that the a guarantees that g is not constant for £ > 1). However, 
P([ra],S, c) may contain $ n 's for integers n that are not factors of 
m also. For example the function g(0) = |, g(l) = | shows that 
$ 6 eP([2],£,c). 

For a non-trivial invertible ergodic measure-preserving transforma- 
tion (X,T,fi), let -R(T) := {m: T has an m-point factor}. We claim 
the following, from which the statement of the corollary holds. 

F(X,T,/i) = (NCI\{±l,±t})U |J P([m],5,c). 

iVC/ \ {±l,±t} is contained in both sides by the above, so that it 
suffices to show that the two sides contain the same cyclotomic poly- 
nomials. That ¥([m], S,c) is contained in P(X, T, «) for m G R(T) is 
clear by lifting a function defined on [m] to X through the factor map. 
For the converse, suppose that $„ G P(X, T, u). Let /: X — > T be 
non-constant but be such that $ n (T)/ = 0. Since $ n (T)/ = and 
$ n (x)|x n - 1, we see that (T n - 1)/ = 0, so that / o T n {x) = f{x) fi- 
a.e. Let m be the period of the sequence (/(T J 'x))j, so that m G R(T). 
Then (X, T, u) factors onto [m] and transferring the function / to [m] 
shows that <3> n G P([m], S", c) as required. □ 
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3.4. Geodesic flows and suspension flows. The small boundary 
property is a technical condition that is convenient for our method of 
proof, but is not necessary for universality. The time-one map of a 
geodesic flow on a compact surface of negative curvature is entropy 
expansive [5j Example 1.6 *] and time- map of a suspension flow can 
also be shown to be entropy expansive by [5, Example 1.6]. The time- 
one map of a geodesic flow of a compact surface of negative curvature 
satisfies specification [5U (Example) F] and the time-one map of a topo- 
logically weak-mixing suspension flow satisfies (weak) specification [4"5] 
Proposition 5]. However, clearly the time-one maps of these flows may 
have uncountably many periodic points, and moreover, it is not dif- 
ficult to construct examples of topologically weak-mixing suspensions 
flows that do not have the small boundary property. In spite of this, 
we proved universality for these flows [45, Theorems 1 and 2]. 

3.5. Non-examples. We give an example to show that it is too much 
to ask that the strict entropy difference in the definition of universality 
can be relaxed. The same example also shows that universality can not 
always be satisfied; this fact also easily follows from the Krieger-Jewett 
theorem [32] . 

Let S be the full-shift on Q := {0, 1} endowed with the Bernoulli 
measure ( that is the unique measure of maximal entropy-log(2). Let 
5" be an irrational rotation of the circle T 1 . The invertible measure- 
preserving transformation S x 5" is ergodic and has entropy log(2). The 
invertible measure-preserving system (Q x T 1 , S x S') can not be embed- 
ded within (fl, S), since the embedding would yield a measure-theoretic 
isomorphism of S x S' and S; this implies that S has a non-trivial zero 
entropy factor which contradicts that fact that S is a i^-automorphism 
[531 Part 1, Paper 2]. Similarly, if the topological dynamical system 
(fixT^Sx S') were universal, then every measure-preserving automor- 
phism with entropy strictly less than log(2) would have a non-trivial 
zero entropy factor. 

4. Soft methods 

4.1. Basic idea. Roughly, the idea of Burton and Rothstein [JTj, is 
that one defines an notion of e-approximate embedding (see Proposition 
US]) with the property that a point lying in the intersection of the e- 
approximate embeddings is a true embedding. One then introduces 
a Polish space (a separable completely metrizable topological space) 
consisting of potential injections (these are in fact joinings, see Lemma 
ITT)) and shows that the e-approximate embeddings form a dense open 
subset of the set of potential embeddings. Baire's theorem gives the 
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desired result. The essential (and surprising) feature of this idea is that 
in a topological sense, almost any candidate works. Burton, Keane and 
Serafin reproved the Krieger generator theorem [30], the Sinai factor 
theorem [52l 153] , and the Ornstein isomorphism theorem |42j . 

Keane and Smorodinsky [28j 129] gave an explicit proof of the Orn- 
stein isomorphism theorem. Similarly, there is an alternative approach 
to producing embeddings that we used in our earlier paper [35J. In 
that approach, one carefully produces a sequence of approximate em- 
beddings, takes a pointwise limit, and proves that this limit has the 
desired properties. (However, our construction does not yield a Uni- 
tary map, unlike those of Keane and Smorodinsky.) 

4.2. Joinings and Baire category. In this subsection, we make pre- 
cise the ideas outlined above. We will make use of the following explicit 
metric giving rise to the weak*-topology on measures. Let Z be a com- 
pact metric space. We let Lip 1 (Z) denote the space of all real- valued 
Lipschitz continuous functions on Z taking values in [0, 1] with Lips- 
chitz constant no greater than 1. Let Ai(Z) denote the collection of all 
Borel probability measures on Z. Write fi(f) := f fd\i for // G Ai(Z) 
and an integrable function / : Z — > R. 

Lemma 14 (Distance giving weak*-topology on measures). Let Z be 
a compact metric space. Define a metric d on Ai{Z) as follows: 

d*(n,v) = sup \fj,(f) - u(f)\ . 
/eLi Pl 

The metric d* gives rise to the weak* topology on Ai(Z). 

The proof is a simple adaptation of the proof in Dudley's book ([20J, 
Theorem 11.3.3) to take account of the fact that functions in Lip x are 
required to take values in [0, 1]. 

In general, a measure-preserving transformation is defined on a mea- 
sure space. In order to use the Baire category machinery, we will need 
the space to be embedded to be a metric space. The following lemma 
allows us to assume that the transformation to be embedded lives on 
a metric space. 

Lemma 15. Let S be a self-homeomorphism of a compact metric space 
Y. Then (Y, S) is (fully) universal if and only if there exists an embed- 
ding of every non-trivial ergodic subshift (X, T, /i) with h^(T) < h top (S) 
into (Y, S) (such that the push-forward of fi is fully supported on Y). 

Proof. The only if part is clear from the definition of universality. For 
the converse by the Krieger generator theorem, for an arbitrary in- 
vertible measure-preserving transformation Tq of the probability space 
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(X , /i ), there is a measure-theoretic isomorphism between (X , T , /io) 
and a subshift (X, T, /i) (which, of course, preserves the entropy). By 
the assumption of the lemma, this subshift may be embedded into 
(Y,S). Composing the isomorphism and the embedding gives the re- 



Thus by Lemma [151 we ma y always assume that the space to be 
embedded is a subshift on a finite number of symbols. 

Let (X, J 7 , fi, T) be a non-trivial ergodic subshift with invariant mea- 
sure /x and Borel u-algebra J 7 . Let S be a self-homeomorphism of a 
compact metric space Y with Borel a- algebra B. By a fi- joining, 
we mean a (T x 5')-invariant measure on the product metric space 
X x Y (with the product a-algebra T <g) B)) whose X- marginal is \i. 
(Note that unlike the standard definition of a joining, we do not make 
any requirement on the ^-marginal; for more background on joinings 
see @H [221 HE]-) We let J^(T,S) denote the space of /x-joinings. It 
is well known that J^(T,S) is a (non-empty) compact metric (hence 
complete) space with the weak* topology. We let ~K\ and 7r 2 be the co- 
ordinate projections from X x Y to X and Y, respectively, and denote 
the push- forward maps by 7r* and 71%, so that for £ G J^T, S), we have 
7r*(£) = /j. Before we prove topological properties regarding subsets of 
the space of joinings, we first state an elementary but useful fact which 
motivates how joinings are related to embeddings. 

Let (Z, Q, £, U) be a measure-preserving system. Let J 7 , B C Q be 
sub-cr-algebras. We write TcJB mod £ if for each F 6 J, there is a 
B £ B such that the ^-measure of the symmetric difference is zero. 



Similarly, for each e > 0, we write J 7 C B mod £, if for each F G J 7 , 
there is a 5 G £> such that the ^-measure of the symmetric difference 
is strictly less than e. For a partition V, we denote by <r(V) the finite 
ex-algebra that it generates. Note that V is a generating partition for 

Z (mod if and only if Q C V/ Je z mod f ■ Also let r z = { > Z ) 

denote the trivial a-algebra on Z. 

Proposition 16. Let (X, J 7 , /j, T) 6e a non-trivial ergodic subshift with 
invariant measure fi and Borel a-algebra J 7 . Let S be a self-homeo- 
morphism of a compact metric space Y with Borel a-algebra B. Let £ 
be a ^-joining satisfying the following conditions 

(i) T x ® B C J 7 ® T Y mod £; 

(ii) T ®T Y CT X <g> B mod £. 

TTien t/iere exists an embedding \I> : X — )■ F o/ (X, T, /i) into (Y, S) 



suit. 



□ 



£(jr x 5) = ^(jrn /or a// (F, B) e J 7 x B. 



(1) 
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Similarly, if ^ is an embedding of (X,T, fi) into (Y,S), then £ defined 
by ([T]) satisfies the first two conditions. 



Thus if £ satisfies the two containment conditions in Proposition [T6l 
we also say that £ is an embedding. The Burton-Rothstein argument, 
that we follow, defines approximate embeddings and works by showing 
that there are a large collection of approximate embeddings at each 
scale. 

Lemma 17 (The Baire Space). Let (X, /x, T) be a non-trivial ergodic 
subshift with invariant measure \l, and let S be a self-homeomorphism 
of a compact metric space Y that is asymptotically entropy expansive. 
Let h^(T) < h top (S). The space defined by 



is a Baire space. 

If U is an invertible measure-preserving transformation on a space 
Z, and / : Z — > R is any real- valued function, we let A^(/) denote the 
Cesaro average given by 



Let \i be a [/-invariant measure on Z. By the Birkhoff ergodic theorem, 
fi is ergodic if for any / G L , Aq (/) converges in measure to a constant. 
In the case that Z is a compact metric space, then we say that x G Z 
is generic (for /i) if for every continuous function / : Z — > R, we have 
that Aq(/)(x) — > fi(f) asn-f oo. An elementary argument gives that 
if fi is ergodic if and only if /j-a.e. x G Z is generic for fi. 

Proof of Lemma [7?| Since S is assumed to be asymptotically entropy 
expansive, by Lemma[5]the entropy functional is upper-semicontinuous. 
Thus the subset of joinings of J M (T, S) satisfying the entropy inequality 
is closed, and hence complete. We next show that the subset of these 
joinings that are ergodic forms a G$ subset of this set. 

Let D C C(X x Y) be a countable dense collection of continuous 
functions. The collection of £ in AAq satisfying the condition: for all 
/ G D and for all j, there exists an n such that £(Aq(/) 2 ) — £(/) 2 < 
1/j is a Gs set; clearly, the condition is that the limit inferior of the 
variances of the n-step Cesaro averages is 0. This condition is satisfied 
if and only if Aq(/) converges in measure (with respect to £) to a 
constant for each / G D, which holds if and only if £ is ergodic. 



Proof. Proposition [T6l follows easily from [19| Theorem 2.8]. 



□ 



M := {£ G J M (T, S) : £ is ergodic and h w *^(S) > h^T)} 




k=m 
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A Gs subset of a complete metric space is a Polish space by a theorem 
of Alexandrov [Mj Theorem 2.2.1]; and the Baire category theorem tells 
us that every Polish space is a Baire space [5H Theorem 2.5.5]. □ 

Proposition 18. Let (X, fi, T) be a non-trivial ergodic subshift with 
invariant measure \x, with its natural generating partition V. Let S 
be a self-homeomorphism with almost weak specification on a compact 
metric space Y that satisfies the small boundary condition witnessed 
by a sequence of refining partitions (Qe), where diam(Q^) < l/£. Let 
hfj,(T) < h top {S). For each £,n > 1, let E™> e be the set of elements £ of 
Mq (defined in Lemma\Tl\j satisfying the following two conditions: 

(i) Tx ® a{Qt) C {\/ l& T- l V) ® T Y mod £; 

l/n 

(it) a(V) ® T Y C T x <8> \l i& T^Q, mod f . 
LetE; = [j^ n Ef. 

(I) The set E™' is a relatively open subset of M.q. 
(II) If '£ G f) n>1 E™, then £ is an embedding. 
(Ill) The set E™ is a dense subset of A4 . 

Let us make a few remarks. First, by (JTT]), we may call E™ the 
collection of l/n- approximate embeddings of (X, T, /i) into (Y, S). 
Second, the small boundary condition is used to prove (111), and almost 
weak specification is used to prove the density condition f illip . Finally, 
most of the hard work will done in verifying fillip . 

Before, we prove the easier parts of Proposition [181 let us P u t to- 
gether a proof of Theorem [71 We need one more lemma in order to 
obtain fully supported measures on Y. 

Lemma 19. Let (X, /x, T) be a non-trivial ergodic subshift with invari- 
ant measure fi, and let S be a self-homeomorphism of a compact metric 
space Y with almost weak specification. Let h^{T) < h top (S). Let Ai' Q 
be the set of all £ G A^o such that vr|(£) is fully supported on Y . Then 
A4' is an intersection of countably many relatively open dense subsets 
ofM . 

We give the proof of Lemma dH] in Section [51 

Proof of Theorem [7| By Lemma [T51 we may assume that (X, T, fi) is 
a non-trivial ergodic subshift. By Lemma [T7J M.q is a Baire space, 
and by Proposition [TBI (JTJ and fillip , we have that £ := f) n>1 E™ is a 
(non-empty) dense subset of A4 ; furthermore, by Lemma [T§1 A4' D£ 
is also a (non-empty) dense subset of A^o- It follows from Proposition 
[TBI (ITl|) that the joinings of £ are also embeddings. □ 
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Proof of Proposition HE (111). If £ G then each element, X x A with 
A G cr(Q^) agrees with an element B xY with B G \J ieIj T ~ %r P up to a 
symmetric difference of measure strictly less than 1/n; moreover, since 
V JeZ T~ i P is the limit of V|i|<m T ~^ there exists B e V^- m T ~^ 
for some finite m with 

(((IxA)A(BxK))<l/n. (2) 

Let mo be the maximum of these m's over the finite set of A's in a(Qe). 

For all £' G Mo we have £'(X x OA) = 0, since 7r|(0 is an S'-invariant 
measure and by the small boundary property, we have 7r|(£')(cM) = 0. 
Since the map v i— > v{A) is continuous at all points where v(dA) = 0, 
inequality ([2]) persists for all sufficiently small perturbations of £. The 
openness of the second condition is proved similarly (using the fact 
that each element of V|i|<fc S~ l Qe has boundary of measure for each 
k and i and every invariant measure). □ 

Proof of Proposition [73 (JTTJ) . The proof follows from Proposition [161 

□ 

The remainder of this paper is dedicated to proving Proposition [T8l 
fllllj) and Lemma PT9l We choose a £ G Aio and a weak*-neighbourhood 
of £. We show the denseness in several stages. Firstly it will be conve- 
nient to assume that h n *^(S) strictly exceeds h^{T) (it is already at 
least h^T) by assumption). In the next section, we show the strict en- 
tropy gap may be assumed without loss of generality. We show that we 
can make an arbitrarily small perturbation to £ (in the weak* topol- 
ogy) so that the entropy condition is satisfied. (Similar techniques 
will also be used to prove Lemma fT9l) Secondly, using the fact that 
h^^S) > hfj,(T), we build a mapping from blocks of X into sepa- 
rated orbit segments in Y. In a third stage, we put this together with 
some marker blocks in Y, verify that the weak*-closeness is satisfied, 
the entropy condition still holds and show that the almost embedding 
property is satisfied. 

5. Upping the entropy 

Lemma 20 (Perturbation to increase marginal entropy). Let (X, /j,, T) 
be a non-trivial ergodic subshift with invariant measure fi. Let S be a 
self-homeomorphism with almost weak specification of a compact metric 
space Y . Let £ G Mo have the property that h^T) < h t0 p{S) and let V 
be a weak* -neighbourhood of t; in Mo- Then there exists a £' G V such 
that h w *^)(S) > h^(T). 
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To prove Lemma [201 we alter the measure £ by the following proce- 
dure. Given an ergodic measure A on Y with h\(S) > h^T), a ^-generic 
point (x, y 1 ), and a A-generic point y 2 that is chosen independently of 
(x,y l ), we produce another point (x,y 3 ), where y 3 is obtained from y l 
by splicing segments of y 1 with segments of y 2 . This is made possible 
by almost weak specification of S, and a careful choice of parameters 
which tune a perturbation of £ so that we have an entropy increase, 
but remain within a prescribed weak* neighborhood. The proof may 
appear somewhat technical because an arbitrary generating partition 
for Y may not have any relation to the specification properties of Y, 
and we also need to construct the perturbation so that we can verify 
the required properties. 

We define the d-bar distance between two processes in the following 
way. By a joining of two probability measure spaces each endowed 
with a transformation, we mean a probability measure on the product 
space that is invariant with respect to the product transformation and 
has coordinate projections equal to the original measures. Let (X, T) 
be the full-shift on a finite number of symbols with invariant measures 
fii and fi 2 . Let 

d(/ii,yu 2 ) := inf / l[x ^ x' ]d£(x, x'), (3) 
£eJ(/ii,M2) J 

where J(//i,/i 2 ) denotes the set joinings of (X, /J>i,T) and (X, /i 2 ,T). 

More generally, let T be a measure-preserving automorphism (X, /i) 
with a measurable partition V = {B\, . . . , B n }. For each x G X, let 
V(x) G V denotes the part that contains x. The V-name is the 
function V : X — > {l,...,n} Z given by V[x)i := V(T~ l x). Thus 
fi-p := fiop" 1 is a shift-invariant measure on the full-shift of n symbols. 
Thus we may define the d-bar distance between two invertible measure- 
preserving transformations (X, fj,, T) and (Z, u, U) endowed with finite 
measurable partitions V and Q with n and m elements, respectively. 
We consider \x-p and Uq to be shift-invariant measures on the full-shift 
of k = max(m, n) symbols. Set 

d(ji,V;v,Q):=d(jPp,u Q ). (4) 

We define H : (0, 1) R via 

%{rj) := —7]\ogr] — (1 — 77) log(l — rj). (5) 

Lemma 21. Let (X,fi,T) and (Z,u,U) be two invertible ergodic mea- 
sure preserving transformations endowed with measurable partitions V 
and Q. Let r]>0. Ifd(n,V]v, Q) < V and \V\ > \ Q\, then 

\h^T,V) - h u (U, Q)\ < U(v) +Vfog |P|. 
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Proof. See Rudolph's book [IE], Theorem 7.9] 



□ 



Proof of Lemma \2(k If h^^S) > h^(T), then we are done, so we 
assume that h n *^(S) = h^T) in what follows. 

Choose an ergodic measure A on Y satisfying h\(S) > h^^^S) = 
h^(T). Choose e G (0, 1) so that V contains the ball B(£,3e) PI .Mo- 
Let S = l(h\{S) - K*(£)(S)). 

By a variation of a standard argument (see for example [5l Lemma 
3.2]), we may choose a partition V = {Bi,...,B n } of Y such that 
7r!(O(0P) = KdV) = and h x (S,V) > h x {S) - e5 and K m (S,V) > 
h n *(£)(S) — e8. Let H be as in <^ and choose 7 < e so that 7/(27) + 
2^ log( (T^l + 1) < e5 and 2^h top (S) < eS. Let r < e be chosen so that 
*2i€)(drV) < 7 and X(d r V) < 7. 

We now build a mixing process taking values in R = {0,1, 2} z . 
Choose N sufficiently large so that the gap function satisfies 

L r (N)<k := L(7/2)iVj; 



in addition, we require that 

1 ; T > 1 - £ - 7 and - ; — > e — 7, 

«o + «i + ^2 fc + + K2 

where fci = [(1 — e — •y/2)N\ and &2 = [_( e ~~ 7/2)^Vj- The process 
consists of concatenations of the blocks i fc io fco 2 fc2 fc ° and i fc i+ 1 o fc °2 fc2 O fc °, 
where the blocks are placed independently with equal probabilities of 
using the longer and the shorter block. (We need to use values k\ and 
k\ + 1 in order to make ( mixing.) Write ( for the measure on R. 

It will be convenient to have notation that will allow us to distinguish 
various copies of Y in certain product spaces. Let KW = Y for i = 
1,2,3. Let a be the shift on R and consider the transformation r = 
a x (T x S) x S acting on = R x (X x FM) x Y^ 2 \ preserving the 
(not necessarily ergodic) measure ( x £ x A. Define a new partition Q 
on the product by A = {(2, y {1 \ y {2) ) ■ z G {1,2} and y { Q Zo) G A} and 
D = {(z,y (1) ,y (2) ): z = 0}. Note that \V\ + 1 = |Q|. The idea is 
knowing which element of Q a point (z,y^\y^) lies in tells us which 
element of P the zo'th copy of y lives in if zq is 1 or 2; or else tells us 
that 2o = 0. Let 1Z denote the a- algebra of fl giving information about 
the R coordinates. 
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Using the fact that y^ and y^ are chosen independently from the 
measures 7r|(£) and A implies that 



1 ( M ~ X \ 1 f M ^ 

lim — H^ xX V/ t-'Q > lim —H ( ^ xX V/ r^Q 

V i=0 / \ i=0 



K 



> (1 - e - t)^ 5 (0(^ ^) + ( e " t)/ia(^, 
By affmeness of entropy and the ergodic decomposition theorem, we 
can take an ergodic component, i, of ( x £ x A such that 

Mr, Q) > (1 - e - 7)^(0(5, 7>) + (c - 7)^a(5, P). (6) 

Since £ is mixing, both £ x Tc^iC,) and ( x A are ergodic. Thus the 
projections of i on R x Y"W and R x F( 2 ) are (still) £ x 7r£(£) and ( x A, 
respectively. 

By induction and the definition of almost weak specification, for an 
t-generic point u> = (z, (x, y^), y^) G fi, we can build a y such that 

d(S n y, S n y M ) < r for all n such that z n G {1, 2}. (7) 

Thus by standard arguments (see the proof of the Krylov-Bogolyubov 
theorem [22, Page 97]) and j7]), there exists a (r x S^-invariant measure 
I on f2' := Q x Y"( 3 ) such that for Z-a.e. point (2, (x,y^),y^ 2 \y^), we 
have 

d(SV 3) , 5 n y( Zw) ) < r for all n such that z n G {1,2}. (8) 

Furthermore, by taking an ergodic component i' of t we may obtain an 
ergodic measure such that (jHJ) holds for t'-a.e. point. Since 1 is ergodic 
and the original point u> was chosen to be generic, the projections of l' 
and I on Q are l. 

We now define tt(z, (x,y^), y ^ 2 \y^ 3 ') = (x,y^). Note that by con- 
struction, y( 3 ' is within r of whenever zq — 1. Let £' = -zf* (^')- 
Clearly, £' is ergodic. 

We now give estimates on the weak* distance between £ and £' and 
on the entropy h^^S). Let / G Lip : (X x F) (recalling that / takes 
values in [0, 1] by definition). Extend / to functions f l , f 3 on fl ; by set- 
ting f\z,{x, y W),yV\yW) = and / 3 (z, (x, y«), y( 2 \ y®) = 

f(x,y {3) ). Also let 

D:={(z, (x,y^),y^,y^)en':z = l}. 

We then have 
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Thus by (JSJ), we have 

d* (£,£')< r + + 7) <3e. (9) 

As for entropy, we estimate it by using ([6]) and the d distance between 
P-name of y^ and the Q-name of (z, (x,y^),y^ 2 \y^ 3 '). 

For u' G fi', if Zo = 1 and ^ c^-P, then t/ 1 ) and y^ lie in the same 
element of V. Similarly if z = 2 and y^ $ d r V, then ?/ 2 ) and y^ lie in 
the same element of V. In particular, if z = 1 and $ d r V or z = 2 
and ^ <9 r "P, then y( 3 ) lies in Bj if and only if (z, (x,y^),y^ 2 \y^) 
lies in Dj. 

On the other hand, since the projection of l' on Q is t and the 
projections of 1 on R x and -R x are £ X ^(O an d C x ^> 
we have 

</{u/ G fi': y (1) G <9 r P and 2 = 1} = 712(f) W) ■ ({z: z = 1}; and 
G fi': y (2) G <9 r P and ^ = 2} = X(d r V) ■ ({z: z = 2}. 

Note that 712(f) is the projection of t' on K®. Let P' be the partition 
of fi' whose parts are given by 

{u/ G Of : y (3) G 5;} for 1 < i < n. 

Let Q! be the partition of Q! whose parts are given by 

{u/ G n' : (z, (x,y (1) ),y (2) ) G A} for < 2 < n. 

It follows that the d-bar distance d(7r|(f'), T 7 ' ; t', Q') is bounded above 
by 

^(0(W-C{^: ^0 = l} + \(d r V)-({z: z = 2} + ({z:z = 0}< 2 7 . 
By Lemma I2"TI we have 

1^(^(5, P) - h,(r x 5, Q)| < %{2i) + 2 7 log(|P| + 1) < e& 
Thus by (jBj) and the fact that /i t (r, Q) = h L i(r x 5*, Q'), we have 

>K m (S,V) 

> (1 - e - 7)^«)(^ ^) + (e - 7)^(5', P) - e5 

> (1 - e)^ (0 (5) " ^ + + 45) - e5 - 2 1 h t0V {S) - e5 

> Kuq(S). □ 

The proof of Lemma [T9l uses similar ideas, but is made easier by the 
fact that we can use Lemma I 
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Proof of LemmaUR Let {V n ) ne ^ be a countable collection of open sets 
forming a neighbourhood basis for Y. Let C n = {£ G A4 : 7r|(0(Ki) > 
0}. By standard properties of the weak*-distance, C n is a relatively 
open subset of Alo in the weak* topology. It suffices to show that C n 
is dense in Afo for each n. 

Fix n G N and e > 0. Let £o G Afo- By Lemma [20], there exists 
£l G Mo such that K^^S) > h^T) and d*(£ ,£i) < e / 2 - Let Q be 
a partition such that ^2(^1) (dQ) — and h n *^(S, Q) > h^(T). Let 
5 < e/4 be such that the following inequality holds: 

ft(25) + 25 log | Q| < ^(^(S, Q) - h^(T), 

where Tl is defined in (JSJ). 

Let r < e/4 be chosen such that 7T$(Ci)(^rQ) < 8 and B(y,r) C K 
for some fixed y G Y". We then choose N such that L r (N) < M : = 
L<5iV"/2j. Let C be a mixing measure on Z = {0,1, 2} z supported on 
concatenations of the blocks 1 N ~ 2M M 2 M and 1 N +^ 2M M 2 M . As 
in the proof of Lemma [2171 we start with the (ergodic) product measure 
£i x ( on (X xY)xZ and obtain an ergodic measure i on (X x F) x 
ZxF whose marginal on the first pair of coordinates is £i and which 
satisfies d(y',y) < r for t-almost every ((x,y), z,y') such that zq — 1 
and d(y',yo) < r for t-almost every ((x,y), z,y') such that zo = 2. 
Projecting 6 onto the initial X coordinate and the final Y coordinate 
gives a measure £' that can be checked using the same arguments as in 
Lemma M to satisfy d* < e/2 and d(7r*(^), Q; 7r 2 *(£'), Q) < 25. 
This is sufficient to ensure that h^^/^S, Q) > h^(T), so that £' G Afo- 
Finally, 7r 2 *(f')(#(y, r)) > 1/(JV + 1), so that f G C n . □ 

6. Marriage via Brin-Katok 

Let (X, T) be a non-trivial ergodic subshift with invariant measure 
li. Let S be a self-homeomorphism on a compact metric space Y . Given 
a yU-joining, £, with h v *^){S) > h^(T), we will define an injective map 
from blocks of A to orbit segments of Y, with certain properties that 
will be useful in constructing perturbations of £ that are approximate 
embeddings. 

This section, while in spirit closely following the proof of the Krieger 
generator theorem given by Burton, Keane and Serafin [TO], differs in 
the details because we embed into a general compact metric space, 
rather than into a shift space. Burton, Keane and Serafin make use 
of the Shannon-Macmillan-Breiman theorem and Hall's marriage the- 
orem, which are also important ingredients in the proofs of the Sinai 
factor theorem and Ornstein isomorphism theorem given by Keane and 
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Smorodinsky [28j EH]. We substitute the Shannon-Macmillan-Breiman 
theorem with a topological analogue due to Brin and Katok [§]. In 
order to define markers in a general setting that is not necessarily sym- 
bolic, we will also make use of a generalization due to Downarowicz 
and Weiss [19] of the Ornstein and Weiss [43J return time formula for 
entropy 

Let S be a self-homeomorphism on a compact metric space (Y,d). 
For integers m < n let 

d^(y,z) = max d(S J y, S 3 z) for all y, z E Y. 

m<j<n 

We define the (d^, S)-Bowen ball about a point y E Y by 

Bl(y,5) := {zeY:dl(y,z)<5}. 
For an S-invariant measure A on Y we define for (z, rj) EY x M + 
^ K (^r ? )=liminf-(l/n)logA( J B n (^,r / )) 

n— >oo 

and 

h™( z , V ) =liminf(l/n)logmin{i > 0: ^(z^z) < r?}. 

n— >oo 

It can be easily checked that h% (z,T]) and h^iz, 1 !]) are ^-invariant 
functions of z, so that if A is an ergodic measure, there are mono- 
tone functions h^ K (rj) and h x (i]) such that /^ K (z, 7/) = /^ K (f?) and 
h^ N {z,rj) = /i° w (?7) for A-almost every z E Y. 

Theorem 22 (Brin and Katok [8]). Let S be a self-homeomorphism of 
a compact metric space Y preserving an ergodic measure A and suppose 
that h\(S) < oo. Then for X-almost every z EY , we have 

]imh* K (z,7 1 ) = h x (S). 

Theorem 23 (Downarowicz and Weiss [19]). Suppose that S is a self- 
homeomorphism of a compact metric space Y preserving an ergodic 
measure A and suppose that h\(S) < oo. Then for X-a.e. z E Y , we 
have 

lim h DW (z,r]) = h x (S). 

Lemma 24 (Marker Lemma). Let S be a self-homeomorphism of a 
compact metric space Y preserving an ergodic measure v with < 
h u (S) < oo. Let a E (0,1) and let r be such that h^ K (2r) > and 
h™(2r) > 0. 

Then for all sufficiently large integers M, there is a point y ma rk G Y 
with the following properties: 

(1) df '(SVark, S eM y^ rk ) > 2r for i E {0, . . . , 6M - 1}; 
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(2) v(Hi) < a/M where E x := 5 2M (y mark , 2r). 
^ z/(# 2 ) < a/M where H 2 := B^^y^, 2r). 

Proof. Property (JlJ) comes from applying Theorem [23] to S 1-1 . Proper- 
ties © and (ED follow from Theorem EI □ 

Corollary 25 (Marker decipherability). Lei K, S , M, r, Hi and H 2 

be as in the statement of Lemma \2~4\ Let z G Y be a point such that 

(i) S l z Hi for <i < L; 
(ii) S*z ^H 2 forO<i<L. 
Also suppose that S satisfies almost weak specification. Let L > 
and assume that the gap function L r (L) < M. Let z G Y be a point 
satisfying the specification conditions: 

(a) d L A f 10M (~z,z)<r 

(b) d 8 M (S L -» M ~z,y mark )<r. 

such a point exists by almost weak specification. Then z satisfies 

d 8 Q M (S l z, y mark ) > r for < % < L - 9M. 

In particular, ifO<j<L — 9M, then j may be recovered from S^z by 
j = (L- 9M) - mm{i > 0: d 8 M (S l z, y mark ) < r} 

Proof. We will check three cases. 
If < i < L - 18M, then we have 

d 8 M (S%y maik ) > d 2 M (S^ M z,S eM y maTk ) 

> dl M (S i+6M z, S 6M y mark ) - r (by (jaj)) 

> 2r — r = r (by (jD}). 

If M < i < L - 11M, then we have 

d« M (^z,y malk ) >d 2 M (S%y malk ) 

>d 2 M (S l z,y mavk )-r (by (jaj)) 
> 2r — r = r (by (jlj)). 
If L - 11 M <i < L- 9M, then we have 
d 8 M (S%y malk ) > dl M (S^ M ~z,S eM y mark ) > 

i2M/ Q j+15M-L„, Q6Af„, \ i2M ( qi+15M-L qi+6M ~\ 

U^O \° i/mark) *J i/markj Uq \° i/mark; *J ^) 

> 2r - r = r (by Lemma El flU ) and ©. □ 

The following lemma involves the choice of many constants (which 
are necessary for our proof of Proposition [TBI ( ILTTj) ) . but it follows easily 
from Theorem [22] and the ergodic theorem. We will refer to Bowen balls 
in the product space X x Y, where X is a non-trivial subshift, and Y 
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is a compact metric space endowed with a self-homeomorphism. For 
the definition of these, we will take the metric to be 

dxxY((x,y), (x',y')) = m&x(dx(x,x'),d Y (y,y')). 

Lemma 26. Let (X, /i, T) be a non-trivial ergodic subshift with in- 
variant measure // and natural generating partition V. Let S be a 
self-homeomorphism with almost weak specification of a compact met- 
ric space Y that satisfies the small boundary condition witnessed by a 
sequence of refining partitions (Qe), where diam(Q^) <!/£. Let £ be a 
li- joining with h^^S) > h^iT). Let e > 0. 
Set 

A:={K si& (S)-h^T))/10. (10) 
Let < 5 < e/80 satisfy the inequalities: 

5(1 + h i (T x S)) < A; (11) 
165(1 + log \V\) < eA; (12) 
4(1 -5)(1 ~ 155) > 3. (13) 

By Theorem UM let rj < e/12 be chosen so that 

h5m(2n,S) > K m (S)-5and (14) 
f(2f),TxS) > h i {TxS)-5. (15) 

Let £ > IO/77 50 that diam(Q£) < 77/IO. Let r < 77/10 satisfy the 
condition of Lemma WJ\ and be such that 



*m(drQt)<5. (16) 

Set 

M:=[5N/11\. (17) 
Then then following conditions hold for all sufficiently large N: 

(a) e NA > 2; 

(b) 2- M < r; 

(c) -k<$; 

(d) the gap function satisfies L r (N) < M; 

(e) Lemma \24\ holds with a = 5 2 /22 and the value of M as defined in 



m 

(1) fi(Si t N) > I — 5 where 

S ljN = {xeX: h(B»(x,t,)) > e -(MT) + A)iv } . 
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(2) 7r|(0(52,jv) >l-S where 

S 2 , N = {yeY: nm(B%- l0M (y,2 V )) < e^^"^}; 

(3) £{S 3:N ) >1-S where 

Ss,n = {(x,y) eXxY: t(B»-™ M (x,y),2r,) < e -W*S)-A)s } . 

(4) ^(S^n) > 1 — 5 where 

S 4 , N = {(x,y) eXxY: t{B»((x,y),T,)) > e -(MTxS) + A)iV } . 

(5) C(^5,n) > 1 — ^ where 

S 5 , N = {(x,y): \A»T WM f(x,y)-£(f)\ < j- 2 for all f e Li Pl (X xY)}; 

(6) k${£){S & , n ) > 1 - 5 where 

S 6 , N = {yeY: A»- l0M l drQe (y) < 5}. 

Note that in Lemma 126], our choices of constants give: 

120r, 80S, I2r] < e. (18) 

Proof of Lemma \2h\ Conditions (jSj), (jbj) and (p) are trivial. By Lemma 
I2U for all sufficiently large M (hence for sufficiently large N), the 
conditions are satisfied establishing (jej). Condition (jd]) holds for large 
iV as a consequence of the definition of almost weak specification. 

Conditions ([1]) and @ follow for large N from the upper estimate in 
Theorem [22] 

For 02]), we note that 

irm(B%r 10M (y,2v)) = ^){Bt nM iv^))- 
Hence for ^(^-almost every y, we have 

liminf - (1/AO logTrSCOC^-^Ci/, 2^)) = (1 - 5)h*f {0 (2 V ) 

> (1 - 5)(h„ m (S) - 5) (by (UU)) 
>K m (S)-6(i + K m (S)) 

> K m (S) - A (by (P)). 

A similar argument shows ([3]) (using (Tl5]) ) holds for large iV. 

That condition ([6]) holds for large N, follows from the Birkhoff er- 
godic theorem and (fTBT) . That condition (jSJ) holds for large AT, would 
follow immediately from the Birkhoff ergodic theorem, if Lip x were a 
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finite set. Recall that by Lip x , we mean those functions with Lipschitz 
constant 1 taking values in [0, 1]. By the Arzela-Ascoli theorem, Lip 1 is 
totally bounded with respect to the uniform norm on continuous func- 
tions. Hence there exists a finite collection F C Lip x that is e/24-dense 
in Lip x with respect to the uniform norm. Now set 

S 5 >,N = {(x,y): \AZ- WM f(x,y)\ < e/24 for all / G F}. 

An application of the triangle inequality shows that S^n D Ss',i\r- No- 
tice that since M = cN, A%f = N/(N-M)-(l/N)(NAg f-cNA c N f), 
so that for a fixed / G Lip 1 (A x 7), we have A^f(x) — > £(/) £-a.e. 
Since F is finite, we have that £(S5',iv) — ^ 1 as N — > oo by the Birkhoff 
ergodic theorem. □ 

Lemma !2"o1 and the following variant of Hall's marriage theorem will 
be used to define the injective map from blocks of X to orbit segments 
of Y. Given a relation R C B x G, we let R(b, •) := {g G G : (b, g) G R} 
and R(; g) := {b G B : {b, g) G R}. 

Theorem 27 (Hall's marriage theorem [23]). Let B and G be finite 
sets. Let R C B x G be a relation with the property that there exists 
K > such that for all b G B we have \R(b, -)| > K, and for all g G G 
we have \R(-,g)\ < K. Then there exists an injection <fi : B — > G such 
that as a relation <fi C R C B x G. 

A proof of Theorem [27] can be found in Downarowicz's book [TTJ 
Appendix A] . In Theorem |27l the sets B and G are often referred to as 
boys and girls, respectively; we call the integer K a marriage bound 
for R, and the map <p a dictionary. 

We say that a subset A C Y is (d^, rj) -separated if d^(a, a') > 77 
for each distinct pair a, a' in A; and is said to be a (d^, 77) -spanning 
subset of i? C Z if A C B and for each 6 G 5, there exists an a G A 
with d^(a, 6) < 77. Notice that a maximal (d™ , -^-separated subset of 
B is necessarily (d^, ^-spanning. 

Given a finite word, B = bo . . . frjv-i) with symbols in the alphabet 
of A, we write [5] for the cylinder set {x G X : x = b , . . . ,Xn-i = 
b N -i}- 

Corollary 28 (Corollary to Lemma 126]) . Fix N so that conditions of 
Lemma hold, with a point y ma rk G Y satisfying (jej) . Let Hi and H 2 
be defined as in LemmaWA Let 



A := {y : T j y H x U H 2 for 0<j< N}. 
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Let G be a maximal (d M ~ 10M , 77) -separated subset of A fl S 2 ,n H S^iv- 
Lei [/clx7 6e given 6?/ 

c/ = (x x (5 2i7V n 5 6iiV n A)) n (s^ x y) n s 3 ,iv n s 4 ,tv n s 5 ,jv. 

Lei B 6e the set of elements B eV N such that /i([B]) > e-^M^+A) 
and£(([B]xY)nU)>±ii{[B]). 
Define 

R= {(B,y) G B x G: 3(u,v) G U with u G [£], d^ 10M (y,7j) < 77} . 

if = l e A r (^(TxS)-/i fl (T)-2A) > 

T/ien 

^ f (C/) > 1 - 75; 

(B) |G| > I e ^I«)^)-A). 

(CJ |B| < e*(MT)+A) and /i (UW^) > 1 " 155 ! 
(7}) K is a marriage bound for R; 

(E) there exists an injection (f) : B — > G such that cf) C -R; 
fFj ifBeB, then for all x G [£], y G B%f 10M (cj)(B) , r) , and for all 
f G Lip^X x Y), we have 

\<f 10M f(x,y)-m\<f2+v + r. 

Proof. (A) Since £ is a joining, from Lemma [261 (j2J), an d (jej), we 
see that £(X x (S^jy H S^iv fl A)) > 1 — 35 and from Lemma [261 
CP) £0Si,jv x y) > 1 - 5 so that Condition (JA]) holds. 

(B) By the definition of G we have, 

|J B»- l0M {y,r]) D A n S 2 , N n 5 6 ,^. 

By Lemma ES © we have for each y G G, 7r^(£)(£^~ 10M (y, 77)) < 
e-*M«)- A ). We deduce |G| > l e N ^^~ A) . 

(C) The first claim follows from the definition of B. For the second, 
let Bi = {B G V N : fi([B]) > e ^ h ^ +A ^ N }. For x G [B], we 
have [B] = B{x,N 1 \) D B(x,N,rj). By Lemma [26] © , we have 
/i(|J BgBi [£?]) > 1 - 5. We then have 

75>au c )>t( |J ([fi] x y) rw c ) > £ ^([B]). 

BeBi\B ' BeBi\B 

This yields fi(^{J BeB [B]J > 1 — 155 as required. 

(D) Let B G B and let R(B, •) = {y u . . . , y n }. We will show that n > 
K. Since (B,yj) G R, it follows there exist (uj,Vj) G Z7 such that 
Uj G [.B] and d^- 10M (^,%) < 77. Let 5, = B*f WM (( Uj , Vj ), 2 V ). 
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Since (u^Vj) G U, by Lemma M © , < e -(M T *s)-A)iv We 

claim that the Sj cover ([B] xY)C\U. To see this, let (u, v) & U sat- 
isfy 116 [B]. Since (u, t>) G U, we have t> G AhS^n HS^jv- Hence, 
by the definition of G, there exists y G G with d^~ 10M (?/, t> ) < 77, 
so that (B,y) G i?; hence y = yj for some j. It follows that 
d^- 10M (v,^) < 2 V . Since u, Ui G [B], we have d^" 10 ^,^-) < 
2~ M < r] so that d^~ 10M ((u, u), Vj)) < 2r] and (>,t>) G S,- as 
required. Since B G B, we have £(([£] xK)nE/)> \Ki B \) > 
I e -(V(^)+A)Ar_ g ince x y) n C/ is covered by the S/s, we see 
that n > ^(h 6 (Txs)-h,(T)-2A) = K ag required . 

For the other half of the argument, let y G G and let R{-,y) = 
{Bi, . . . , B m }. Pick witnesses (ui,Vi) G U so that Uj G [Bj\ and 
d^f 10M (vi, y) < rj. Let = Bq ((ui,v-i),rj). These sets are all 
contained in X x B*f 10M (y, 2rj), which, since y G G, has measure 
at most e -(S*(«) (5) - A)Af by Lemma [26] (EJ. Since (u;,^) G U, 
by Lemma [261 (jlj), each Di has measure at least e -( h i( TxS )+ A -) N . 
Finally, if i 7^ i', then d^(uj,Uj/) = 1 so that the Di are disjoint. 
In particular, we deduce from ([TO]) and Lemma [26] ^aj) that m < 

e JV(^(Tx5)-^* (e) (5)+2A) < R 

(E) Property ([E]) follows immediately from Property ([D|) and Theorem 
EH 

(F) Let BgB and x G [5]. Let / G Li Pl . Since (B,(f)(B)) G i?, there 
exists (xo,yo) G [/ such that x G [-B] and ?/o £ B^f 10M (<j)(B), rf). 
Thus from the definition of [/ and Lemma [261 (O, we have 



|A^ 10A ^o,yo)-£(/)|<e/12. 

If x G [-B] and y is any point in B^~ 10M (0(1?), r), then we have 
d((T i x, S'y), (T%, S^o)) < r + r] for each M < i < N — 10M, so 
that by the Lipschitz property, 

\A^f 10M f(x,y)-^f)\ <f 2 + V + r, 

as required. □ 

7. Proof of Proposition [18] (pB 

In this section, we use Corollary [25], Lemma [26] and Corollary [28] 
to build a joining. Using Lemma [201 we then prove the following re- 
statement of Proposition [18] fllllj) . where we assume a strict entropy 
gap. 

Lemma 29. Consider the setup of Lemma \2h\ Let (X,fi,T) be a non- 
trivial ergodic subshift with invariant measure fi and natural generating 
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partition V. Let S be a self-homeomorphism with almost weak specifi- 
cation on a compact metric space Y that satisfies the small boundary 
condition witnessed by a sequence of refining partitions (Qt), each hav- 
ing zero measure boundary, where dmm(Qi) < l/£. Let £ be a //- joining 
with hn*(£)(S) > hfi(T). Let e > 0. There exist £ and a n -joining, £ ; 
satisfying the following properties: 

• weak* -closeness: d*(£, £) < e; 

• approximate embedding properties: Qe C Viez T~ l P mod £; 

andV C Viez 5 "*^ mod f; 

• entropy preservation: h^^^S) > h^T). 

Proof of Proposition [731 flLTT|) . Let £ G .M . Let n > 0. Without loss 
of generality, let < s < 1/n. We need to find a £ G E™ such that 

d*(£, < £■ By Lemma [20l we may assume without loss of generality 
that h w *(£)(S) > h^(T). So the result follows from Lemma □ 

We need one more tool before we can define the joining in the proof of 
Lemma [291 We will make use of the following variation of the Rokhlin 
tower theorem. 

Lemma 30 (Rokhlin tower theorem: independent base version). Let 

(Q, J 7 , /i, T) be a non-periodic measure-preserving system. Let N be a 
positive integer and 5 > 0. For any finite measurable partition V, there 
exists F G T (the basej with the following properties. 

(1) The sets F, TF, . . . , T N ~ 1 F are pairwise disjoint. 

(2) The complement of their union E :— Q \ [jf^ 1 T l F (the error 
set) has measure exactly 5. 

(3) The a -algebras generated by F and V are independent, so that 
\i{F n P) = fJ.(F)fj,(P) for all P eV. 

For a proof, see the book of McCutcheon and Kalikow [201 Theorem 
184]. After defining the joining in the proof of Lemma I2"91 we motivate 
why it satisfies the desired properties before we proceed with technical 
calculations. 

Let (Y, d) be a compact metric space. Given two sequences of points 
u = (jji), v = (y'j) of Y, and r > 0, we say that u r -shadows v in I 
if d(yi, y[) < e for all i G I. 

Proof of Lemma \2~9c definition o/£. Choose A, 5, T), £, r, M, and N as 
in Lemma [2H1 By Lemma [301 let F be the base of a Rokhlin tower in 
X with height iV and error set E of measure 5 with the property that 
F is independent of V/ilo 1 T~ % V. Given a point x that we assume to 
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be generic for /x, we let n (x) = sup{k < 0: T k x G F U E }. We then 
let (rij(x))j E z be the enumeration of {k G Z: T fc a; G FUE } satisfying 

. . . < n„ 2 (^) < n-i(x) < n (x) < n\{x) < . . . . 

The jth block of x is then the block 

Bj\X) %nj(x)-Enj(x)+\ ■ ■ ■ •En,j+i(x)— 1 ■ 

The blocks are of length iV (for those j such that T nj ( x \x) G F) and 
1 for those points on the orbit that land in the error set Eq. 

Let 2/mark, B, G, and (j) be given by CorollaryEHl We introduce special 
symbols D, V ^ Y (with D standing for 'dictionary' and V standing 
for 'vacuous'). Let G D = G U {D} and Y v = Y U {V}. Equip G^ 
with the usual shift map a, and the Bernoulli measure ( giving each 
coordinate mass e/(2|G|) to each member of G and 1 — e/2 to D. If 
rij + i(x) — rij(x) = N, let rrij(x) := rij(x) + N — 9M and set 

Rf°(x) = [ nj (x) + M, nj{x) +N- 10M); and 
Rf* rk (x) = [ mj {x),m 3 {x) + 8M); 

otherwise, let i?j nfo (x) = i?™ ark (x) = 0. We now define a map ij) : 
X x Gf, Y$ by 

n e 5,(x) G B, z nj{x) G G; 

S^-niW^^^z)) n G Rf io {x), Bj{x) G B, z nj{x) = D; 
S n -^^ ymaik n G i?f ark (^), ^(z) G B, z n . {x) = D; 
V otherwise. 



ip(x,z) n 



The sequence ^(x, z) of points in IV is the sequence of points that 
we wish to r-shadow, while V denotes the null constraint. By condition 
([d]) of Lemma [26l the orbit segments that we are attempting to shadow 
are of length less than N and are separated by at least M > L r (N), so 
that given (x, z), a /i x (^-generic point of X x G^,, there exists y G Y 
satisfying the specification constraint given by 

d(S' n y, ?/>(x, z) n ) < r whenever -0(x, z) n ^ V, for all jigZ. 

Define a product space Q and an invertible measure-preserving trans- 
formation t via: 

Q := X x G£ x Y and r := T x tr x 5. 

As in the proof of Lemma [2171 there exists an ergodic invariant measure 
i on Q such that for t-almost all points u = (x, z, y) 

d(S n y, if)(x, z) n ) < r whenever i/j(x, y) ^ V, for all n G Z. (19) 
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Define £ to be the ergodic measure obtained by projecting l onto the 
first and last coordinates of the tuple. By the genericity assumption 
above, £ is a //-joining. □ 

It remains to verify that £ satisfies the required properties. Using 
Corollary [28] (JF]) on the blocks of length N— 11 M where we are applying 
the dictionary on B, we will obtain the weak*-closeness by Corollary 
ESQ and (JED. 

The fact that we are placing markers in between the coded boys 
will enable us to recover most of x from y, giving us the latter half 
of the approximate embedding property. To guess which element of 
Qi that a point y belongs to from x, we have no chance if B (x) does 
not belong to B. If it does, and the dictionary is applied, instead 
of using a random element of G, then we know that (S k y) shadows 
(3 k - no( ^<l>(B j {x))) in R™ io (x). Unless S^^B^x)) lies close to the 
boundary of Qg, we can deduce which element y lies in, giving us the 
other approximate embedding property. 

Finally, the entropy of S with respect to n\ (£ ) has a lower bound that 
is the sum of two contributions: one term is the entropy obtained by 
encoding most of the blocks of X, and another is given by sometimes 
using random elements of G. We do not encode blocks that are not 
members of B and because of the error set in the Rokhlin's lemma 
the contribution from encoding blocks of X is strictly less than h^(T); 
however, this entropy loss is at most 

(^(\J[B])+fi(E )) - login 

V B^B J 

For an e/2 proportion of blocks that do belong to B, we do not apply 
the dictionary (ft, but instead use a random element of G; we will see 
that this results in a net gain of entropy that is large enough to cover 
the losses incurred by encoding only members of B and by not being 
able to encode anything on the set E . 

Proof of Lemma W9c weak* -closeness. Let P G \/^ =i ^ T ~ l V, where V is 
the natural generating partition for X. By Lemma [301 y>(P H F) — 
fi(P)fi(F). Since T preserves the measure \i and for all < k < N — 1 
[x E X : T~ k x e F] = {x G X : x E , n (x) = —k}, we have 

fi {x G X : T no{x) x G P, x E , n (x) = -k} = fi(P)fi(F) (20) 
for all < k < N — 1 and hence 



fi{xeX : T noix) x eP:x^E }= fi(P)(l - 5). 



(21) 
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Define the 'bad set' by 



BSi 


= {{x, 


z, y) G fi : x 


G£ } 


BS 2 


= {(*, 


z, y) G f2 : x 


^ #0, ^noO) ■ • • Xno(x)+N-l B} 


BS 3 


= {(*, 


z,y) G : x 




BS 4 


= {(*. 


z, y) G f2 : x 


^E ,O^Rf°(x)}- 


BS 


= BSi 


U BS 2 U BS 3 


U BS 4 . 



We have that t(BSi) = fi(E ) = 5. By (ED and Corollary M (0) , 

we get 

t(BS 2 ) = (1 - <fy/ |J [sA < (1-5)155. 

^ BeV N \B ' 

By independence of ( and /x, we have t(BS 3 ) = (1 — S)(e/2). Using the 
fact that the length of the interval i?Q nfo (x) is iV — 11M when x G" E , 
by our choice of M = [5N/11\ in (IT7|) . we have t(BS 4 ) < 5. Thus by 
(ITS]) , we have 

t(BS) < 175 + e/2 < 3e/4. (22) 

Let / G Lip x (X x Y). For < fe < N, let A fc = {(x,z,y): x G 
BSJ n BSg n BS3, no (a:) = —k}. Notice that 

BS C = BS^ n BS^ n BS^ n BS^ 

iV-10Af-l 

= BS\ n bs 2 n BS3 n (J {(x, 2, y) g n ■. n (x) = — fe} 



fc=M 



AT-10M-1 



= u ^- 

k=M 

So we have 

/(x,y) d£{x,y) = 



f(x,y) di{x,z,y) 

JV-10M-1 



/ f{x,y)dL(x,z,y)+ / f(x,y)di( 

Jbs k=M JA k 



x,z,y) 



BS 



/(x, y) *, y) + (N - 11M) / A^ 10M /(x, y) di{x, z, y). 



A 



By Corollary [281 flF]) , for (x,z,y) G A), we have |A 



N-WM 
M 



f(x,y) 



< e/12 + r] + r. We also have (N — llM)fi(A ) = /i(BS c ). Hence 
we by (1221 and (ITS]) , we have 

l£(/) - £(/) I < MBS) + e/12 + V + r<e. □ 



x{x) 
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Proof of Lemma \29i approximate embedding properties. We first show 

that Qe C \] i( zj j T % V mod £. Enumerate Qi as {D\, . . . ,D n } and re- 
gard Qe as a map from Y to {1, . . . ,n} where Qe(y) — j if y G Dj. 
Define 

(s o Mx) <j){Bo{x)) if it's defined; 
\vmaA otherwise. 
We define Q' from X to {1, . . . , n} by 

Q'(x) = Qe(x(x)). 
Since Q' is c(\J i€Z T J P)-measurable, it suffices to show that 
t({(x,z,y): Q e (y)^Q'(x)})<e. 

Let 

BS 5 :={(x,z,y) £ BS: X (x) e d r Q t }. 
Notice that by definition of l, we have 

{(x, z, y) : Q £ (y) ^ Q'{x)} C BS U BS 5 mod l, 

By the definition of the set G and 0, we have that <p{B) G Se,N for all 
B G B. Thus by Lemma [26] © we have A^f 10M l BS5 {x, z,y) < 5 on 
A . We compute as in the weak*-closeness section. 

JV-lQAf-l 

t(BS 5 )= ^(BS 5 nA fc ) 

k=M 

= (N-UM) [ A^- 10M l BS5 (x,z,y)dt(x,z,y)<6. 
Ja 



By J22D and QSJ, we have 

y) : ^ Q'(a;)} < t (BS U BS 5 ) < fe + 5 < e. 

For the approximate embedding in the opposite direction, we use the 
markers and the invertibility of <p to argue similarly. We let 

BS 6 = {(x, z, y) G n : n {x) < -N + 9M}. 

We see t(BS 6 ) < 9M/N < 5. Define 

n (y) = min (min{A; > 0: S k (y) G B 8 M -\y mark ,r)} - (N - 9M),o). 

Equip B with an arbitrary total order and define a map b: Y — > B by 
%) = argmind^- 1OM (y,0(5)), 

breaking ties lexicographically if necessary. Finally, set 

p(y) = b(S^y\y)U o{y) . 
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For L-a.e. (x,z,y) G fl \ (BS U BSg), we have n (y) = no(x) by the 
definition of t and Corollary 121)1 Since G is a (d^, ?/)-separated set, 
by (fl9l) and choice of parameters in Lemma 1261 d^~ 10M (y,(f)(B (x)) < 
r < 77/IO. Since <f> is one-to-one, there is only one girl, namely 4>(Bq(x)) 
within 77/5 of the corresponding block of y; thus, b(y) = B (x). 
Hence we see that 

l({(x, y) : V{x) ± V(y)}) < l(BS U BS 6 ) < 195 < e. □ 

Proof of Lemma \2§c entropy preservation. We define a number of par- 
titions of Q that we shall need in order to do calculations. Regard the 
partition Qe of Y as a partition of Q. We will express a typical point 
of lo G Q as uj = (2, z, y). Define 

Ci := {uj G O: x g E ,B (x) G B and z nQ{x} = D}, 

and 

C 2 := {u G tt: x g E , B (x) G B and z noix) ^ D}. 
Let C := Q \ (d U C 2 ), and C := {C , C x , C 2 }. Set 




Let 

Q\ 2 := {QnC c :QeQe}u{C }. 
For each j G {1,2}, let 

Q\:= {QnQ:QeQ<}u{fl\C j }. 

Let 

gj> 2 ;= {{ w G ^(x,z) G Q}: Q G Q,} U {C }. 
For each j G {1,2}, let 

Q{ := {{to G Q: ip{x,z) G Q}nC,-: Q G Q £ } U {fi\C7,-}. 

Let yfj~M° M1 S~' l Qi = {Ai, . . . , Al}. Let Q[ BL ^' 2 be the partition 
given by {yl[ BIj ' 1 ' 2 ) A^ 1,2 , . . . , x ' 2 }, where 

Af L]1 ' 2 : = {uGCiUC 2 : n (x) = and y G 

for 1 < i < L and Aq^ 1,2 is the complementary set. (Here the 'BL' 
stands for 'block.') Let Q^' 1 ' 2 be the partition given by 

rT[BL]l,2 7[BL]1,2 JpLJl^, 

where 

If L]1 ' 2 := {uj G C x U C 2 : n (x) = and ^(x, z)^ 10 ^ 1 G A { } 
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for 1 < % < L and Ajf^ 1 ' 2 is the complementary set. Similarly for 
j G {1,2}, let Q [BL]i be the partition {Af L]j , Af L]j , . . . , Af L]j } where 

A^ L]j := {cu G Cf. n (x) = and ip{x,z) G A} 

and /1q BL ' ji is the complementary set. 

Finally let pP^ 1 be the partition with elements 

{uj G C\ : x G P and n (x) = 0} , 

where P G V^Lo 1 T~ l V, together with the complement of the union of 
this collection; "p[ BL l 2 be the partition with elements 

{u G C 2 : x G P and n (x) = 0} , 

where P G V/ilo 1 T~ l V, together with the complement; and V° be 
the partition with elements P C\ {u: B (x) (jL Bon G E }, where 
P G V/ilo 1 T~ l V, again along with the complement. 

Let us pause to explain the above notations. The partition C tells 
you whether you are attempting to shadow an element of G determined 
by the dictionary <fi (Ci), a random element of G (C2) or if there is no 
constraint (C ). For the Qi partitions, the superscript 1 indicates that 
one is looking at those times when one is shadowing an element of 
G determined by 0; 2 indicates that one is shadowing an random el- 
ement of G; and 1,2 indicates that one is shadowing either of these 
two. The tildes indicate the partition element that one is aiming for 
(i.e. the partition element that ip(x, z) lies in) rather than the par- 
tition element that y actually ends up lying in. Also the superscript 
'BL' indicates that one is getting a whole block's worth of informa- 
tion at once, whereas otherwise one gets the information a symbol at 
a time. The partition Q 1 / 2 tells you which element of Qf the point y 
ends up in for the parts that are constrained by tp(x, z). The partition 
Qf L]1 ' 2 tells you which element of \J i=M S % Q(. the point y ends 
up in if x is at the base of the Rokhlin tower and is at the start of 
a B block; Q^' 2 tells you which element of \ffS]^ 0M1 S~ % Qt the orbit 
segment you are aiming for ((ip(x, z)i) M<i<N _ WM ) belongs to when x 
is at the base of the Rokhlin tower and" is at the start of a B block. 
The partition Q\ tells you which element of \jfS^ M ~ X S~ l Qi the or- 
bit segment (ip(x, z)i) M<i<N _ WM belongs to when you are shadowing 

an element of G determined by 0; and Qe tells you which element 
of \fiJ^ 0M _1 S~ l Q e the orbit segment (ip(x, z)i) M <i<N~ioM belongs to 
when you are shadowing an random element of G. The partitions p[ BL l J 
(for j = 1, 2) tell you the block Xq -1 when x is at the base of the tower 
and -Bq(^) G B if the dictionary is being used (j = 1) or if the word 
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is being randomized (j = 2). Note that the partition V° tells you the 
symbol x when B (x) ^ B. one E . The partition P VP [BL]1 VP [BL]2 
is a generating partition for fj, 
Let 

E:= {ueO:i^4 n (x) = 0, B (x) 6B,2 / D} . 

Then by f )20|) and Corollary [28] (JCJ), and the independence of \i and (, 
we have t(E) > (l/iV)(l - 5){1 - 155)(e/2), so that by we have 

8iVA • 6(E) > 3eA. (23) 

We also note that Cq is the set of points in Q whose first coordi- 
nate belongs to E U [jf^ 1 t~ % [f fl Ub^b^IJ ■ We therefore calculate 

l(C ) = fi(E Q ) + (1 - fi(Eo))/N ■ N • //(Ub*b[£]) < m - Usin S AH 
we obtain 

t(C ) log |P| < eA (24) 
The following facts will be used to complete the calculation. 

(a) h(r,Qe\K) > K(r, QY\R); 

(b) K(T,Ql' 2 \n)>K(T,Ql' 2 \ny, 

(c) ^(T,Qf|^)>/i t (r,Qt BLll ' 2 |^); 

(d) /i t (r, Qf h]1 > 2 \K) = K(t, Qf L]1 \TZ) + h,(r, Qf h]2 \TZ); 

(e) k(r,Qf L]1 \n) = hfaV^lK); 

(f) /i t (r,Qf L]2 |^)=/i(E)log|G|; 

(g) k{t,v\h) < h L (r,v^n) + h L (T,v^ 2 \n) + K(r,v \n); 

(h) h t (T,V°\K)<i(C )log\V\<eA; 

(i) h,{r,V^ 2 \TZ) < /x(E)log|B|; 
(j) fc t (r,C) < log 3 <eA. 

Notice that (jgj), (jcj) and (jgj) follow from the fact that if V\ and P2 are 
partitions such that o(\J i& T-" l Vi) D T -i P 2 ), then h{r,V x ) > 

h{j ) T > 2)- Facts (jgj) and (jh]) follow from standard entropy results, and 
(f2"l"|) . To see that (jBJ) holds, notice that observing the elements of 7?. 
and Q\' 2 the point w lies in is sufficient to determine which element 
of G was being targeted; this follows from the same reasoning used in 
the second half of the proof of the approximate embedding property. 
That no(x) = and u G C% is a 1Z- measurable event since u points 
stay in C\ for an exact multiple of iV steps. Hence one can deduce the 
partition element of Q e ' that u> belongs to given the sequence of 1Z and 
Qi t 2 partition elements. For (jfj and we work with the information 
functions I L (Qf 2 [R) and I L (V B2 \Tl); these are if u g E, whereas if 
u E, then they are log |G| (since each girl appears independently with 
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equal likelihood) and at most log | B | respectively. To establish (ld|) . it 
suffices to show that h t (Qf L]2 \ Qf L]1 VK) = h L {Qf L]2 \TZ). This follows 
since the second coordinate of Q is independent of the first. Finally, to 
see ([]]), we use Abramov's formula with the induced transformation of 
t to the set 

A := {u e Q : x G E U F} . 

Notice that between visits to A, the system stays entirely in a single 
element of C. Hence we see that h t (r,C) < i(A)log3 < (5 + ^) log 3; 
this is bounded above by eA using ( fl2|) and Lemma [26] (jcj). 

We then have the following calculation. By Corollary [28] ([B]) and 

0, 

log(|G|/|B|) > N(K m (S) - h,(T) - 2A). (25) 
Let h = h n *^(S). We have that 

h>K* {i) (S, Qi) 

> K(r, Q e \K) > h L (r, QY\R) > K(r, Q\' 2 \TZ) (by (jsj) and ©) 

> K(r, Q¥ L]1 ' 2 \K) = h L (r, Qf L]1 \TZ) + K(t, Qf L]2 \TZ) (by ([cj), ©) 
= /i l (r,p[ BL ] 1 |^) + 6(E) log |G| (by (jgj) and ©). 

Hence 

> K(r,V\TZ) - K(r,V lBL]2 \n) - h L (r,V°\n) + l{E) log|G| (by (jgj)) 

> K(t, V) - ^(r, C) - eA + t(E) log(|G|/|B|) (by © and ©) 

> h^T) - 2eA + 8iVA6(E) (by §, ([25]) and (HOD 

> ^(T) (by (J23D). □ 
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